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Abstract. We consider the eigenvalues and eigenvectors of finite, low rank perturba- 
tions of random matrices. Specifically, we prove almost sure convergence of the extreme 
eigenvalues and appropriate projections of the corresponding eigenvectors of the per- 
turbed matrix for additive and multiplicative perturbation models. 

The limiting non-random value is shown to depend explicitly on the limiting eigenvalue 
distribution of the unperturbed random matrix and the assumed perturbation model via 
integral transforms that correspond to very well known objects in free probability theory 
that linearize non-commutative free additive and multiplicative convolution. Further- 
more, we uncover a phase transition phenomenon whereby the large matrix limit of the 
extreme eigenvalues of the perturbed matrix differs from that of the original matrix if 
and only if the eigenvalues of the perturbing matrix are above a certain critical threshold. 
Square root decay of the eigenvalue density at the edge is sufficient to ensure that this 
threshold is finite. This critical threshold is intimately related to the same aforemen- 
tioned integral transforms and our proof techniques bring this connection and the origin 
of the phase transition into focus. Consequently, our results extend the class of 'spiked' 
random matrix models about which such predictions (called the BBP phase transition) 
can be made well beyond the Wigner, Wishart and Jacobi random ensembles found in 
the literature. We examine the impact of this eigenvalue phase transition on the associ- 
ated eigenvectors and observe an analogous phase transition in the eigenvectors. Various 
extensions of our results to the problem of non-extreme eigenvalues are discussed. 



1. Introduction 

Let X n beannxn symmetric (or Hermitian) matrix with eigenvalues Xi(X n ), . . . , X n (X n ) 
and P n be an n x n symmetric (or Hermitian) matrix with rank r < n and non-zero eigen- 
values 9i, . . . , 9 r . A fundamental question in matrix analysis is the following [I3J |2j: 

How are the eigenvalues and eigenvectors of X n + P n related to the eigen- 
values and eigenvectors of X n and P n l 
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When X n and P n are diagonalized by the same eigenvectors, we have Xi(X n + P n ) = 
Xj(X n )+\k(P n ) for appropriate choice of indices i,j, k E {1, . . . , n}. In the general setting, 
however, the answer is complicated by the fact that the eigenvalues and eigenvectors of 
their sum depend on the relationship between the eigenspaces of the individual matrices. 

In this scenario, one can use Weyl's interlacing inequalities and Horn inequalities [22\ 
to obtain coarse bounds for the eigenvalues of the sum in terms of the eigenvalues of 
X n . When the norm of P n is small relative to the norm of X n , tools from perturbation 
theory (see [221 Chapter 6] or [36]) can be employed to improve the characterization of the 
bounded set in which the eigenvalues of the sum must lie. Exploiting any special structure 
in the matrices allows us to refine these bounds [23] but this is pretty much as far as the 
theory goes. Instead of exact answers we must resort to a system of coupled inequalities. 
Describing the behavior of the eigenvectors of the sum is even more complicated. 

Surprisingly, adding some randomness to the eigenspaces permits further analytical 
progress. Specifically, if the eigenspaces are assumed to be "in generic position with re- 
spect to each other", then in place of eigenvalue bounds we have simple, exact answers 
that are to be interpreted probabilistically. These results bring into focus a phase transi- 
tion phenomenon of the kind illustrated in Figured] for the eigenvalues and eigenvectors 
of X n + P n and X n x (J n + P„). A precise statement of the results may be found in Section 

El 

Examining the structure of the analytical expression for the critical values 9 C and p in 
Figure [1] reveals a common underlying theme in the additive and multiplicative perturba- 
tion settings. The critical values 9 C and p in Figure [H are related to integral transforms of 
the limiting eigenvalue distribution px of X n . It turns out that the integral transforms 
that emerge in the respective additive and multiplicative cases are deeply related to very 
well known objects in free probability theory J37J EU E] that linearize free additive and 
multiplicative convolutions respectively. In a forthcoming paper [12], we consider the 
analogue of the problem for the extreme singular values of finite rank deformations of 
rectangular random matrices. There too, a phase transition occurs at a threshold deter- 
mined by an integral transform which plays an analogous role in the computation of the 
rectangular additive free convolution [7J EJ |9] . The emergence of these transforms in the 
context of the study of the extreme or isolated eigenvalue behavior should be of indepen- 
dent interest to free probabilists. In doing so, we extend the results found in the literature 
about the so-called BBP phase transition (named after Baik, Ben Arous, Peche because 
of their seminal paper [1]) for the eigenvalue phase transition in such finite, low rank 
perturbation models well beyond the Wigner [311 E31 ESI QU E] , Wishart U U E3 Effl ES] 
and Jacobi settings [27]. In our situation, the distribution p x in Figure [T] can be any 
probability measure. Consequently, the aforementioned results in the literature can be 
rederived rather simply using the formulas in Section El by substituting px with the semi- 
circle measure [38] (for Wigner matrices), the Marcenko-Pastur measure [26] (for Wishart 
matrices) or the free Jacobi measure (for Jacobi matrices p2]). Concrete computations 
are presented in Section [31 

The development of the eigenvector aspect is another contribution that we would like 
to highlight. Generally speaking, the eigenvector question has received less attention 
in random matrix theory and in free probability theory. A notable exception is the 
recent body of work on the eigenvectors of spiked Wishart matrices [301 E31 EB] which 
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(c) Largest eigenvalue p = b in blue when (d) Associated eigenvector 

6 <9 C when 9 < 6 C 



Figure 1. Assume that the limiting eigenvalue distribution of X n is fix 
with largest eigenvalue b. Consider the matrix P n := 9uu* with rank r = 1 
and largest eigenvalue 9{> 0, say). The vector u is an n x 1 vector chosen 
uniformly at random from the unit n-sphere. The largest eigenvalue of 
X n + P n will differ from b if and only if 9 is greater than some critical 
value 9 C . In this event, the largest eigenvalue will be concentrated around 
p with high probability as in (a). The associated eigenvector u will, with 
high probability, lie on a cone around u as in (b). When 9 < 9 C , a phase 
transition occurs so that with high probability, the largest eigenvalue of 
the sum will equal b as in (c) and the corresponding eigenvector will be 
uniformly distributed on the unit sphere as in (d). 

corresponds to fix being the Marcenko-Pastur measure. In this paper, we extend their 
results for multiplicative models of the kind (J + -P„) 1 / 2 X„(/ + P n ) 1 / 2 to the setting where 
fix is an arbitrary probability measure and obtain new results for the eigenvectors for 
additive models of the form X n + P n . 

Our proofs rely on the derivation of master equation representations of the eigenvalues 
and eigenvectors of the perturbed matrix and the subsequent application of concentration 
inequalities for random vectors uniformly distributed on high dimensional unit spheres 
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(such as the ones appearing in [El [20]) to these implicit master equation representations. 
Consequently, our technique is simpler, more general and brings into focus the source of 
the phase transition phenomenon. The underlying methods can and have been adapted 
to study the extreme singular values and singular vectors of deformations of rectangular 
random matrices, as well as the fluctuations [10] and the large deviations [11] of our 
model. 

The paper is organized as follows. In Section [21 we state the main results and present 
the integral transforms alluded to above. Section [3] presents some examples. An outline 
of the proofs is presented in Section |H Exact master equation representations of the 
eigenvalues and the eigenvectors of the perturbed matrices are derived in Section and 
utilized in Section E] to prove the main results. Technical results needed in these proofs 
have been relegated to the Appendix. 



2. Main results 

2.1. Definitions and hypotheses. Let X n be an nxn symmetric (or Hermitian) random 
matrix whose ordered eigenvalues we denote by \i(X n ) > • • • > X n (X n ). Let nx n be the 
empirical eigenvalue distribution, i.e., the probability measure defined as 

1 n 

Assume that the probability measure \xx n converges almost surely weakly, as n — > oo, to 
a non-random compactly supported probability measure fix ■ Let a and b be, respectively, 
the infimum and supremum of the support of fix- We suppose the smallest and largest 
eigenvalue of X n converge almost surely to a and b. 

For a given r > 1, let d\ > • • • > 9 r be deterministic non-zero real numbers, chosen 
independently of n. For every n, let P n be an n x n symmetric (or Hermitian) random 
matrix having rank r with its r non-zero eigenvalues equal to 9\, . . . , 6 r . Let the index 
s £ {0, . . . , r} be defined such that Q x > ■ ■ ■ > 9 S > > 9 s+ i > ■ ■ ■ > 6 r . 

Recall that a symmetric (or Hermitian) random matrix is said to be orthogonally in- 
variant (or unitarily invariant) if its distribution is invariant under the action of the 
orthogonal (or unitary) group under conjugation. 

We suppose that X n and P n are independent and that either X n or P n is orthogonally 
(or unitarily) invariant. 

2.2. Notation. Throughout this paper, for / a function and c 6 t, we set 

f(c + ) :=\im f(z); f(c~) := lim/(z), 

z\,c zjc 

we also let denote almost sure convergence. The ordered eigenvalues of an n x n 
Hermitian matrix M will be denoted by Ai(M) > ■ • • > X n (M). Lastly, for a subspace 
F of a Euclidian space E and a vector x £ E, we denote the norm of the orthogonal 
projection of x onto F by (x, F). 
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2.3. Extreme eigenvalues and eigenvectors under additive perturbations. Con- 
sider the rank r additive perturbation of the random matrix X n given by 

X n X n -\- P n . 

Theorem 2.1 (Eigenvalue phase transition). The extreme eigenvalues of X n exhibit the 
following behavior as n — > oo. We have that for each 1 < % < s, 



^i{X n ) 



G-^im ife i >i/G lix (b+), 

otherwise, 



while for each fixed i > s, Xi(X n ) b. 

Similarly, for the smallest eigenvalues, we have that for each < j < r — s, 



^n-j{X n ) 



a otherwise, 



while for each fixed j > r — s, X n ^j(X n ) 
Here, 



a.s. 



G^ x (z) 



J -^—^dnx{t) for z <£ supp/ix, 



is the Cauchy transform of fix, is its functional inverse so that 1/ ± oo stands for 

0. 

Theorem 2.2 (Norm of the eigenvector projection). Consider io G {1, . . . , r} such that 
l/9i € (G flx (a~),G[ lx (b + )). For each n, define 



\ io {X n ) if9 io >0, 



(x n ) ife io <o, 



and let u be a unit-norm eigenvector of X n associated with the eigenvalue Aj . Then we 
have, as n — > oo, 



(a) 



\(u,ker(e i0 In-Pn))\ 2 - 



where p = G~ x (l/6i ) is the limit of Aj ; 
(b) 

(u,® t ^ kei(9J n -P n )) ^0. 

Theorem 2.3 (Eigenvector phase transition). When r = 1, let the sole non-zero eigen- 
value of P n be denoted by 9. Suppose that 

! ;G^(6 + ) = -oo if9>0, 







G'^a-) = -oo t/0<O. 
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For each n, let u be a unit-norm eigenvector of X n associated with either the largest or 
smallest eigenvalue depending on whether 9 > or 9 < 0, respectively. Then we have 

(u,ker(9I n -P n )) ^>0, 

as n — > oo. 

The following proposition allows to assert that in many classical matrix models, such 
as Wigner or Wishart matrices, the above phase transitions actually occur with a finite 
threshold. The proposition is phrased in terms of b, the supremum of the support of 
Px, but also applies for a, the infimum of the support of px- The proof relies on a 
straightforward computation which we omit. 

Proposition 2.4 (Square-root decay at edge and phase transitions). Assume that the 
limiting eigenvalue distribution \xx has a density f fJiX with a power decay at b, i.e., that, 
as t — > b with t < b, ffi x (t) ~ c(b — t) a for some exponent a > —1 and some constant c. 
Then: 

G^ x {b + ) < oo -<=>- a > and G'^ x {b + ) = — oo -<=>- a < 1, 

so that the phase transitions in Theorems \2.1\ and \2.3\ manifest for a = 1/2. 

Remark 2.5 (Necessity of eigenvalue repulsion for the eigenvector phase transition). 
Under additional hypotheses on the manner in which the empirical eigenvalue distribution 
of X n fix as n — > oo, Theorem 12.21 can be generalized to any eigenvalue with limit 
p equal either to a or b such that G'^ x (p) is finite. In the same way, Theorem 12.31 can 
be generalized for any value of r. The specific hypothesis has to do with requiring the 
spacings between the Aj(X n )'s to be more "random matrix like" and exhibit repulsion 
instead of being "independent sample like" with possible clumping. We plan to develop 
this line of inquiry in a separate paper. 

2.4. Extreme eigenvalues and eigenvectors under multiplicative perturbations. 

We maintain the same hypotheses as before so that the limiting probability measure px, 
the index s and the rank r matrix P n are defined as in Section I2TT1 In addition, we assume 
that for every n, X n is a non-negative definite matrix and that the limiting probability 
measure px is not the Dirac mass at zero. 

Consider the rank r multiplicative perturbation of the random matrix X n given by 

X n = X n x (/„ + P n ). 

Theorem 2.6 (Eigenvalue phase transition). The extreme eigenvalues of X n exhibit the 
following behavior as n — > oo. We have that for 1 < i < s, 

fc(iM) ife i >i/T, x (b+), 

\b otherwise, 
while for each fixed i > s, \i(X n ) -^V b. 
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In the same way, for the smallest eigenvalues, for each < j < r — s, 

(T-i(l/ej) if9 j <l/T flx ( a -), 

^n-r+j(X n ) — — > < 

la otherwise, 

while for each fixed j > r — s, \ n -j(X n ) — ^ a. 
Here, 

T^ x (z) = J -^—^d/ixit) for z <£ supp/ix, 
is the T-transform of fix, T~ x (-) is its functional inverse and 1/ ± oo stands for 0. 

Theorem 2.7 (Norm of eigenvector projection). Consider i G {l,...,r} such that 
l/0 io G (T Atx (a~),T Alx (6 + )). For each n, define 



A; 



\ io (X n ) if9 io >0, 



(x n ) ife io <o, 



and let u be a unit-norm eigenvector of X n associated with the eigenvalue Aj . Then we 
have, as n — > oo, 

a) 

|<tZ,kBT(Vn " Pn)}\ 2 -'■ 



lor "/ix^ 1 

where p = T~^(l/9i ) is the limit of Aj ; 
b) 

(u,® j ^ kex{9 j I n -P n )) ^>0. 

Theorem 2.8 (Eigenvector phase transition). VF/ien r = 1, lei the sole non-zero eigen- 
value of P n be denoted by 9. Suppose that 



£ £ ( T »x( a ), T » X ( b+ )), and 



T'(b + ) = -oo z/^>0 ; 



r; x (a-) = -oo */0<o. 



For each n, let u be the unit-norm eigenvector of X n associated with either the largest or 
smallest eigenvalue depending on whether 9 > or 9 < 0, respectively. Then, we have 

(u,ker(0I n -P n )) ^VO 

as n — > oo. 

Proposition 2.9 (Square-root decay at edge and phase transitions). Assume that the 
limiting eigenvalue distribution fix has a density f flx with a power decay at b (or a or 
both), i.e., that, as t — > b with t < b, f^ x {t) ~ c(b — t) a for some exponent a > — 1 and 
some constant c. Then: 

T llx {b + ) < oo -<=>- a > and T^ x (6 + ) = — oo -<=>- a < 1, 

so that the phase transitions in Theorems \2.b\ and \2.8\ manifest for a = 1/2. 
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The analogue of Remark 12.51 also applies here. 

Remark 2.10 (Eigenvalues and eigenvectors of a similarity transformation of X n ). Con- 
sider the matrix S n = (I n + P n ) 1 ^ 2 X n (I n + P n ) 1 ^ 2 ■ The matrices S n and X n = X n (I n + P n ) 
are related by a similarity transformation and hence share the same eigenvalues and con- 
sequently the same limiting eigenvalue behavior in Theorem 12.61 Additionally, if Ui is a 
unit- norm eigenvector of X n then Wi = (I n + P n Y' 2 Ui is an eigenvector of S n and the 
unit-norm eigenvector V; t = Wi/\\wi\\ satisfies 

Iff kerf*-/ _ P)) p-Ml^^ 

It follows that we obtain the same phase transition behavior and that when \jQi G 
(T Mx (a-) J T Mx (6+)) J 

9- + 1 

| (vi, ker(9il n - P n ))\ 2 - 1 and (v h ker(6»;/ n - P n )) 0, 

so that the analogue of Theorems 12.71 and 12.81 for the eigenvectors of S n holds. 

2.5. The Cauchy and T transforms in free probability theory. 

2.5.1. The Cauchy transform and its relation to additive free convolution. The Cauchy 
transform of a compactly supported probability measure \i on the real line is defined as: 



f dfi(t) 

G^{z) = / tor z f. supp /x. 



If [a, b] denotes the convex hull of the support of /x, then 

G^(a~) := limGJz) and G fl (b + ) := lira G^(z) 

zfa z\b 

exist in [— oo, 0) and (0, +oo], respectively and realizes decreasing homeomorphisms 
from (— oo, a) onto (G^(a~), 0) and from (6, +oo) onto (0, Throughout this paper, 

we shall denote by the inverses of these homeomorphisms, even though can 

also define other homeomorphisms on the holes of the support of /i. 

The R-transform, defined as 

R,(z) := G-\z) - l/z, 

is the analogue of the logarithm of the Fourier transform for free additive convolution. 
The free additive convolution of probability measures on the real line is denoted by the 
symbol EB and can be characterized as follows. 

Let A n and B n be independent n x n symmetric (or Hermitian) random matrices that 
are invariant, in law, by conjugation by any orthogonal (or unitary) matrix. Suppose 
that, as n — > oo, Ha u — > Ha and Hb„ — > Hb- Then, free probability theory states that 
HA n +B n — > Ha EB Hbi a probability measure which can be characterized in terms of the 
/^-transform as 

R^ A m^ B (z) = R» A (z) + R^ B (z). 
The connection between free additive convolution and G~ x (via the i?-transform) and the 
appearance of G~ l in Theorem 12.11 could be of independent interest to free probabilists. 
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2.5.2. The T -transform and its relation to multiplicative free convolution. In the case 
where \i ^ Sq and the support of /i is contained in [0, +oo), one also defines its T -transform 




for z supp fix, 



which realizes decreasing homeomorphisms from (— oo, a) onto (T M (a~), 0) and from (b, +oo) 
onto (0,7j t (6 + )). Throughout this paper, we shall denote by T" 1 the inverses of these 
homeomorphisms, even though T M can also define other homeomorphisms on the holes of 
the support of \x. 

The S-transform, defined as 

SM :={\ + z)/{zT;\z)l 

is the analogue of the Fourier transform for free multiplicative convolution M. The free 
multiplicative convolution of two probability measures fiA and /is is denoted by the sym- 
bols Kl and can be characterized as follows. 

Let A n and B n be independent nxn symmetric (or Hermitian) positive-definite random 
matrices that are invariant, in law, by conjugation by any orthogonal (or unitary) matrix. 
Suppose that, as n — > oo, fiA n — > Ha and fiB n — > Hb- Then, free probability theory 
states that (iA„-B n — > Ha ^ Hb, a probability measure which can be characterized in 
terms of the 5"-transform as 

S^a^hb ( z ) = S ilA {z)S ilB {z). 
The connection between free multiplicative convolution and T~ l (via the S'-transform) and 
the appearance of T" 1 in Theorem 12 . 61 could be of independent interest to free probabilists. 

2.6. Extensions. 

Remark 2.11 (Phase transition in non-extreme eigenvalues). Theorem 12.11 can easily 
be adapted to describe the phase transition in the eigenvalues of X n + P n which fall 
in the "holes" of the support of fi X - Consider c < d such that almost surely, for n 
large enough, X n has no eigenvalue in the interval (c, d). It implies that G^ x induces a 
decreasing homeomorphism, that we shall denote by G fJiX ^ Ct d), from the interval (c, d) onto 
the interval (G flx (d~),G^ x (c + )). Then it can be proved that almost surely, for n large 
enough, X n + P n has no eigenvalue in the interval (c, d), except if some of the 1/^'s are 
in the interval (G^^d - ), G^ x (c + )), in which case for each such index i, one eigenvalue of 
X n + P n has limit G~^^ cd) (l/6i) as n — > oo. 

Remark 2.12 (Isolated eigenvalues of X n outside the support of fix)- Theorem 12 . 1 1 can 
also easily be adapted to the case where X n itself has isolated eigenvalues in the sense that 
some of its eigenvalues have limits out of the support of fix- More formally, let us replace 
the assumption that the smallest and largest eigenvalues of X n tend to the infimum a and 
the supremum b of the support of fix by the following one. 

There exists some real numbers 

if,..., ip + G (b, +oo) and t[, . . . , t~_ E (-oo, a) 
such that for all 1 < j < p + , 
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and for all 1 < j < p~ , 

Moreover, A 1+p +(X n ) — b and \ n -(i+p-){X n ) a. 

Then an attentive look at the proof of Theorem 12 . 1 1 shows that it still holds, in the following 
sense (we only present the point of view of the largest eigenvalues): the matrix X n still 
has eigenvalues tending to the ft's, but also, for each 1 < i < s such that 6i > 1 / 'G \ lx {b + ) , 
one eigenvalue tending to G~^(l/6i), all other largest eigenvalues tending to b. 

Remark 2.13 (Other perturbations of X n ). The previous remark forms the basis for 
an iterative application of our theorems to other perturbational models, such as X = 
y/~X(I + P)y/X + Q for example. Another way to deal with such perturbations is to 
first derive the corresponding master equations representations that describe how the 
eigenvalues and eigenvectors of X are related to the eigenvalues and eigenvectors of X and 
the perturbing matrices, along the lines of Proposition 15.11 for additive or multiplicative 
perturbations of Hermitian matrices. 

Remark 2.14 (Random matrices with Haar-like eigenvectors). Let G be an n x m Gauss- 
ian random matrix with independent real (or complex) entries that are normally dis- 
tributed with mean and variance 1. Then the matrix X = GG*/m is orthogonally (or 
unitarily) invariant. Hence one can choose an orthonormal basis (U\, . . . , U n ) of eigen- 
vectors of X such that the matrix U with columns Ui, . . . , U n is Haar-distributed. When 
G is a Gaussian-like matrix, in the sense that its entries are i.i.d. with mean zero and 
variance one, then upon placing adequate restrictions on the higher order moments, for 
non-random unit norm vector x n , the vector U*x n will be close to uniformly distributed 
on the unit real (or complex) sphere [221 EH 122] • Since our proofs rely heavily on the 
properties of unit norm vectors uniformly distributed on the n-sphere, they could possibly 
be adapted to the setting where the unit norm vectors are close to uniformly distributed. 

Remark 2.15 (Setting where eigenvalues of P n are not fixed). Suppose that P n is a ran- 
dom matrix independent of X n , with exactly r non-zero eigenvalues given by 6^, . . . , 9i n \ 
Let 6^ Qi as n — > oo. Using [221 Cor. 6.3.8] as in Section 15.2.31 one can easily see 
that our results will also apply in this case. 

The analogues of Remarks 12. 11| l2~T2| 12.141 and 12. 151 for the multiplicative setting also hold 
here. In particular, Wishart matrices with c > 1 (cf Section 13. 2j) gives an illustration of 
the case where there is a hole in the support of fix- 

3. Examples 

We now illustrate our results with some concrete computations. The key to applying 
our results lies in being able to compute the Cauchy or T transforms of the probability 
measure fix and their associated functional inverses. In what follows, we focus on set- 
tings where the transforms and their inverses can be expressed in closed form. In settings 
where the transforms are algebraic so that they can be represented as solutions of poly- 
nomial equations, the techniques and software developed in [22] can be utilized. In more 
complicated settings, one will have to resort to numerical techniques. 
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3.1. Additive perturbation of a Gaussian Wigner matrix. Let X n be an n x n 

symmetric (or Hermitian) matrix with independent, zero mean, normally distributed en- 
tries with variance a 2 /n on the diagonal and a 2 / (2n) on the off diagonal. It is known that 
the spectral measure of X n converges almost surely to the famous semi-circle distribution 
with density 



2 



y/ '4<7 2 — X 

dnx{x) = — — t. dx for x G [— 2a, 2a]. 

2a re 

It is known that the extreme eigenvalues converge almost surely to the endpoints of the 
support [I]. Associated with the spectral measure, we have 



Gfi x (z) = - Sgn ^y 2 for z G (-oo, -2a) U (2a, +oo) 



G„ x (±2a) = ±a and G£(l/0) =0 + $. 

Thus for a P n with r non-zero eigenvalues Q\ > ■ ■ ■ > 6 S > > 6 S+1 > • ■ ■ > 9 r , by 
Theorem 12.14 we have for 1 < i < s, 

p n ) ^ { : 1 97 [ [ 9 ' > ° 

otherwise, 




as n — > oo. This result has already been established in [18] for the symmetric case and in 
[31] for the Hermitian case. Remark 12. 141 explains why our results should hold for Wigner 
matrices of the sort considered in [311 [18] . 



In the setting where r = 1 and P = 9 uu*, let u be a unit-norm eigenvector of X n + P n 
associated with its largest eigenvalue. By Theorems 12.21 and 12.31 we have 



\(u,u)f 



i ii6>a, 

ae<a. 



3.2. Multiplicative perturbation of a random Wishart matrix. Let G„ beannxm 
real (or complex) matrix with independent, zero mean, normally distributed entries with 
variance 1. Let X n = G n G* n /m. It is known [2S] that, as n,m — > oo with njm — > c > 
0, the spectral measure of X n converges almost surely to the famous Marcenko-Pastur 
distribution with density 

d/lx(x) ■= ( b ~ X )( X ~ + max f°) 1 - ^0, 

where a = (1 — a/c) 2 and b = (1 + y/c) 2 . It is known [3] that the extreme eigenvalues 
converge almost surely to the endpoints of this support. 

Associated with this spectral measure, we have 



1 — sgn(z — a) y/ \z — a)(z — b) 



2c 

x _ (z + l)(cz + l) 
I f l x { z ) - 
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When c > 1, there is an atom at zero so that the smallest eigenvalue of X n is identically 
zero. For simplicity, let us consider the setting when c < 1 so that the extreme eigenvalues 
of X n converge almost surely to a and b. Thus for P n with r non-zero eigenvalues Q\ > 
■ ■ ■ > 6 S > > 9 s+ i > ■ • • > 9 r , with li := $i + 1, for c < 1, by Theorem 12.61 we have for 
1< i < s, 



\i{X n {I n + P n )) 




h-i 



if \k - 1| > y/c 

otherwise, 



as n — y oo. An analogous result for the smallest eigenvalue may be similarly derived by 
making the appropriate substitution for a in Theorem 12.61 Consider the matrix S n = (I n + 
Pn) 1 / 2 X n (I n + P„) 1 / 2 . The matrix S n may be interpreted as a Wishart distributed sample 
covariance matrix with "spiked" covariance I n + P n . By Remark I2.10[ the above result 
applies for the eigenvalues of S n as well. This result for the largest eigenvalue of spiked 
sample covariance matrices was established in jU [30] and for the extreme eigenvalues in 
0- 

In the setting where r = 1 and P = 6uu*, let / = 9 + 1 and let u be a unit-norm 
eigenvector of X n (I + P n ) associated with its largest (or smallest, depending on whether 
I > 1 or I < 1) eigenvalue. By Theorem 12.81 we have 

(j-i) 2 



} |2 ±±> I (J-l)[c(J+l)+I-l] lf \ l l \ ~ >/c, 

if |Z — 1| < -v/c- 



Let v be a unit eigenvector of S n = (7 n + P n ) 1 / 2 X n (/ n + P n ) 1 / 2 associated with its largest 
(or smallest, depending on whether / > 1 or / < 1) eigenvalue. Then, by Theorem 12.81 and 
Remark 12. 10^ we have 



\(v,u)\ 



2 a - s -. 



if \l~l\> y/c, 
if|/-l|<v^. 



The result has been established in [30] for the eigenvector associated with the largest 
eigenvalue. We generalize it to the eigenvector associated with the smallest one. 

We note that symmetry considerations imply that when X is a Wigner matrix then —X 
is a Wigner matrix as well. Thus an analytical characterization of the largest eigenvalue 
of a Wigner matrix directly yields a characterization of the smallest eigenvalue as well. 
This trick cannot be applied for Wishart matrices since Wishart matrices do not exhibit 
the symmetries of Wigner matrices. Consequently, the smallest and largest eigenvalues 
and their associated eigenvectors of Wishart matrices have to be treated separately. Our 
results facilitate such a characterization. 



4. Outline of the proofs 

We now provide an outline of the proofs. We focus on Theorems 12.11 12T21 and |2~3"| which 
describe the phase transition in the extreme eigenvalues and associated eigenvectors of 
X+P (the index n in X n and P n has been suppressed for brevity). An analogous argument 
applies for the multiplicative perturbation setting. 
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Consider the setting where r = 1, so that P = 6uu*, with u being a unit norm column 
vector. Since either X or P is assumed to be invariant, in law, under orthogonal (or uni- 
tary) conjugation, one can, without loss of generality, suppose that X = diag(Ai, . . . , X n ) 
and that u is uniformly distributed on the unit n-sphere. 



4.1. Largest eigenvalue phase transition. The eigenvalues of X + P are the solutions 
of the equation 

det(zl - (X + P)) = 0. 
Equivalently, for z so that zl — X is invertible, we have 

zi-(x + p) = (zi - x) ■ (i - (zi - xy l p), 

so that 

det(zJ - (X + P)) = det(zl - X) ■ det(J - (zi - X) -1 P). 

Consequently, a simple argument reveals that the z is an eigenvalue of X + P and not 
an eigenvalue of X if and only if 1 is an eigenvalue of the matrix {z I — X)~ l P. But 
(zi — X)~ 1 P = (zi — X)~ l 9uu* has rank one, so its only non-zero eigenvalue will equal 
its trace, which in turn is equal to OG^z), where \x n is a "weighted" spectral measure of 
X, defined by 

n 

fi n = \uk\ 2 8\ k (the Mfc's are the coordinates of u). 

k=l 

Thus any z outside the spectrum of X is an eigenvalue of X + P if and only if 



UK? „ , , 1 



:GU^) = - (1) 



z-X k ' 9 

k=l 

Equation flTJ describes the relationship between the eigenvalues of X + P and the eigen- 
values of X and the dependence on the coordinates of the vector u (via the measure 

This is where randomization simplifies analysis. Since u is a random vector with uniform 
distribution on the unit n-sphere, we have that for large n, \uk\ 2 ~ - with high proba- 
bility. Consequently, we have /i n ~ /ix so that G^z) w G Mx (z). Inverting equation ^ 
after substituting these approximations yields the location of the largest eigenvalue to be 
G~i(l/0) as in Theorem EH 

The phase transition for the extreme eigenvalues emerges because under our assumption 
that the limiting probability measure fix is compactly supported on [a, b], the Cauchy 
transform G fJLX is defined outside [a, b] and unlike what happens for G^ n , we do not 
always have G /lx (b + ) = +oo. Consequently, when 1/6 < G ftx (6 + ), we have that Xi(X) ~ 
G~ x (l/6) as before. However, when 1/0 > G flx (b + ), the phase transition manifests and 

X 1 (X) w X ± (X) = b. 

An extension of these arguments for fixed r > 1 yields the general result and constitutes 
the most transparent justification, as sought by the authors in jl], for the emergence of 
this phase transition phenomenon in such perturbed random matrix models. We rely on 
concentration inequalities to make the arguments rigorous. 
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4.2. Eigenvectors phase transition. Let u be a unit eigenvector of X + P associated 
with the eigenvalue z that satisfies ([T]). From the relationship (X + P)w = zu, we deduce 
that, for P = 9uu*, 

(zl — X)u = Pu = 6uu*u = {6u*u).u (because u*u is a scalar), 

implying that u is proportional to (zl — X)^ 1 u. 

Since u has unit-norm, 

(zI-X)-^ 
y/u*(zl - X)~ 2 u 

and 

,<.« - w - w - "z^x - fif - wM- (3) 

Equation (j2J) describes the relationship between the eigenvectors of X + P and the eigen- 
values of X and the dependence on the coordinates of the vector u (via the measure 

Here too, randomization simplifies analysis since for large n, we have \i n ~ px and 
z ~ p. Consequently, 

(*-t) 2 ~J (p-ty " ^ lPj ' 

so that when 1/9 < G^ x (b + ), which implies that p > b, we have 

whereas when 1/9 > G /lx (b + ) and G Mx has infinite derivative at p = 6, we have 

(u,ker(0/-P)) ^> 0. 



An extension of these arguments for fixed r > 1 yields the general result and brings 
into focus the connection between the eigenvalue phase transition and the associated 
eigenvector phase transition. As before, concentration inequalities allow us to make these 
arguments rigorous. 



5. The exact master equations for the perturbed eigenvalues and 

eigenvectors 

In this section, we provide the r-dimensional analogues of the master equations ([T]) and 
(|21) employed in our outline of the proof. 

Proposition 5.1. Let us fix some positive integers 1 < r < n. Let X n = diag(Ai, . . . , X n ) 
be a diagonal n x n matrix and P n = U n ^ r QU* r , with = diag(#i, . . . ,9 r ) an r x r 
diagonal matrix and U n>r an n x r matrix with orthonormal columns, i.e., U* r U n ^ r = I r ). 

a) Then any z {Ai, . . . , A n } is an eigenvalue of X n := X n + P n if and only if the r x r 
matrix 

M n (z) := I r - U* tr (zl n - X n )- x U n ^ (4) 
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is singular. In this case, 

dimker(z/ n — X n ) = dimker M n (z) (5) 
and for all x G ker(zl n — X n ), we have U* r x G ker M n (z) and 

x = (zi n -x n )- 1 u nir eu*, r x. (6) 

b) Let u^j denote the (fc, l) — th element of the n x r matrix U n>r for k = 1, . . . , n and 
I = 1, . . . , r. Then for all i,j = 1, . . . , r, the (i,j)-th entry of the matrix I r — U* r (zl n — 
X ra ) _1 f/ n)T .6 can be expressed as 

h =J -e 3 G „(*), (7) 

where fif 1 ) is the complex measure defined by 

n 
k=l 

and G o) is the Cauchy transform of f/ff 1 ) . 

c) In the setting where X = X n x (J n + P n ) and P n = U n ^ r QU* r as before, we obtain 
the analog of a) by replacing every occurrence, in @ and (jH]), of (zl n — X n )~ l with 
(zl n — X n )~ 1 X n . We obtain the analog of b) by replacing the Cauchy transform in 
with the T -transform. 

Proof. Part a) is proved, for example, in [2J Th. 2.3]. Part b) follows from a straightfor- 
ward computation of the (i, j)-th entry of U* r (zl n — X n ) _1 [/ n]r 0. Part c) can be proved 
in the same way. □ 



6. Proof of Theorem 12.11 

The sequence of steps described below yields the desired proof: 

(1) The first, rather trivial, step in the proof of Theorem 12 . 1 1 is to use Weyl's interlacing 
inequalities to prove that any extreme eigenvalue of X n which does not tend to a 
limit in M\[a, b] tends either to a or b. 

(2) Then, we utilize the "master equations" of Section [5]to express the extreme eigen- 
values of X n as the z's such that a certain random r x r matrix M n (z) is singular. 

(3) We then exploit convergence properties of certain analytical functions (derived in 
the appendix) to prove that almost surely, M n [z) converges to a certain diagonal 
matrix Mq (z), uniformly in z. 

(4) We then invoke a continuity lemma (see Lemma [6.11 - derived next) to claim that 
almost surely, the z's such that M n (z) is singular (i.e. the extreme eigenvalues of 
X n ) converge to the z's such that Mc^ x (z) is singular. 

(5) We conclude the proof by noting that, for our setting, the z's such that Mc f , x (z) 
is singular are precisely the z's such that for some i G {1, . . . ,r}, G flx (z) = j-. 
Part (ii) of Lemma IfTT] about the rank of M n (z), will be useful to assert that when 
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the #;'s are pairwise distinct, the multiplicities of the isolated eigenvalues are all 
equal to one. 

6.1. A continuity lemma for the zeros of certain analytic functions. We now 

prove a continuity lemma that will be used in the proof of Theorem 12.11 We note that 
nothing in its hypotheses is random. As hinted earlier, we will invoke it to localize the 
extreme eigenvalues of X n . 

For z G C and E a closed subset of R, set d(z, E) = min x6 .E \z — x\. 

Lemma 6.1. Let us fix a positive integer r, a family . . . ,9 r of pairwise distinct nonzero 
real numbers, two real numbers a < b, an analytic function G(z) of the variable z G C\[a, b] 
such that 

a) G(z) G R zeR, 

b) for all z G R\[a,b], G'(z) < 0, 

c) G(z) — > as \z\ — > oo. 

Let us define, for z G C\[a, b], the r x r matrix 

M G {z) = diag(l - 6 1 G(z), . . . , 1 - 6 r G(z)), (8) 

and denote by z\ > ■ ■ ■ > z p the z 's such that Mq{z) is singular, where p G {0, . . . , r} is 
identically equal to the number of i 's such that G(a~) < 1/0, < G(b + ). 

Let us also consider two sequences a n , b n with respective limits a, b and, for each n, a 
function M n (z), defined on z G C\[a n , b n ], with values in the set ofrxr complex matrices 
such that the entries of M n (z) are analytic functions of z. We suppose that 

d) for all n, for all z G C\R, the matrix M n (z) is invertible, 

e) M n (z) converges, as n — > oo, to the function M G (z), uniformly on {z G C ; d(z, [a, b]) > 
r]}, for all r) > 0. 

Then 

(i) there exists p real sequences z Uj i > . . . > z ntP converging respectively to z\, . . . , z p 
such that for any e > small enough, for n large enough, the z 's in M\[a — e, b + e] 
such that M n (z) is singular are exactly z ni i, . . . , z n>p , 

(ii) for n large enough, for each i, M n (z n ^) has rank r — 1. 

Proof. Note firstly that the z's such that Mq(z) is singular are the z's such that for a 
certain j G {1, . . . , r}, 

1 - 9 3 G{z) = 0. (9) 

Since the 0j's are pairwise distinct, for any z, there cannot exist more than one j G 
{1, . . . ,r} such that holds. As a consequence, for all z, the rank of Mg(z) is either 
r or r — 1. Since the set of matrices with rank at least r — 1 is open in the set of r x r 
matrices, once (i) will be proved, (ii) will follow. 
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Let us now prove (i). Note firstly that by c), there exists R > max{|a|, \b\} such that 
for z such that \z\ > R, \G(z)\ < mini ^n- F° r an Y such z, \ det Mg(z)\ > 2~ r . By e), it 
follows that for n large enough, the z's such that M n (z) is singular satisfy \z\ > R. By 
d), it even follows that the z's such that M n (z) is singular satisfy z G [— R, R]. 

Now, to prove (i), it suffices to prove that for all c,d<E K\([a, b] U {zi, . . . , z p }) such 
that c < d < a or b < c < d, we have that as n — > oo: 

(H) the number of z's in (c,d) such that det M n (z) = 0, denoted by Card C) d(n) tends 
to Card Cj( i, the cardinality of the z's in {1, . . . ,p} such that c < Zi < d. 

To prove (H), by additivity, one can suppose that c and d are close enough to have 
Carded = or 1 . Let us define 7 to be the circle with diameter [c, d] . By a) and since 
c,d £ {z±, . . . , z p }, det Mg(-) does not vanish on 7, thus 

Card, ' / ^det AfcW 1 f d. det M„(z) 

' 2in 7 7 detM G (2) n^oo2m &etM n (z) 

the last equality following from e). It follows that for n large enough, Card Ci d(n) = Card c ,<i 
(note that since Carded = or 1, no ambiguity due to the orders of the zeros has to be 
taken into account here). □ 

6.2. Proof of Theorem 

6.2.1. First step: consequences of Weyl's interlacing inequalities. Note that Weyl's inter- 
lacing inequalities imply that for all 1 < i < n, 

\+(r-s)(X n ) < \(X n ) < Xi_ s (X n ), (10) 

where we employ the convention that Xk{X n ) = — oo is k > n and +oo if k < 0. It follows 
that the empirical spectral measure of X n pix because the empirical spectral measure 
of X n does as well. 

Since a and b belong to the support of fix, we have, for alH > 1 fixed, 

liminf Xi(X n ) > b and limsuplimsup \ n+ i-i(X n ) < a. 

By the hypotheses that 

\i(X n ) -^-4 b and X n (X n ) a, 

it follows that for alH > 1 fixed, Xi(X n ) -— > b and X n+ \-i{X n ) a. 

By ffTUl) . we deduce both following relation (ITTj) and ffT2^) : for alH > 1 fixed, we have 

liminf Aj(X n ) > 6 and limsup X n+ i-i(X n ) < a (11) 

n— >-oo n— >oo 

and for alH > s (resp. i > r — s) fixed, we have 

\(X n ) ^ b (resp. X n _ t (X n ) ^ a). (12) 

Equations fill I) and (1121) are the first step in the proof of Theorem 12.11 They state that 
any extreme eigenvalue of X n which does not tend to a limit in R\[a, b] tends either to a 
or b. Let us now prove the crux of the theorem related to the isolated eigenvalues. 



18 FLORENT BENAYCH-GEORGES AND RAJ RAO NADAKUDITI 

6.2.2. Isolated eigenvalues in the setting where 9x,...,9 r are pairwise distinct. In this 
section, we assume that the eigenvalues 9i,...,9 r of the perturbing matrix P n to be 
pairwise distinct. In the next section, we shall remove this hypothesis by an approximation 
process. 

For a momentarily fixed n, let the eigenvalues of X n be denoted by Ai > ... > A n . 
Consider orthogonal (or unitary) n x n matrices U x , Up that diagonalize X n and P n , 
respectively, such that 

X n = U x diag(Ai, . . . , X n )U* x , P n = U P diag(0i, . . . , 9 r , 0, . . . , 0)U* P . 

The spectrum of X n + P n is identical to the spectrum of the matrix 

diag(Ai, . . . , A„) + U* X U P ding(9 1 ,...,9 r ,0,...,0)U* P U x . (13) 

denoted by U n 

Since we have assumed that X n or P n is orthogonally (or unitarily) invariant and that 
they are independent, this implies that U n is a Haar-distributed orthogonal (or unitary) 
matrix that is also independent of (Ai, . . . , A n ) (see the first paragraph of the proof of [21], 
Th. 4.3.5] for additional details). 

Recall that the largest eigenvalue Xi(X n ) — — > b, while the smallest eigenvalue X n (X n ) 
a. Let us now consider the eigenvalues of X n which are out of [A n (X re ), Xi(X n )]. By Propo- 
sition 15.11 a) and an application of the identity in Proposition 15.11 b) these eigenvalues are 
precisely the numbers z [X n (X n ), Xi(X n )] such that the r x r matrix 

M n {z):=I r -\e 3 G^{z)]l J=1 (14) 



1,3 



is singular. Recall that in (I14p . G ( n )(z), for i, j = 1, . . . , r is the Cauchy transform of the 
random complex measure defined by 

n 

(n) \ ^ (n) On) r /i c \ 

Ad Z^ U k,i U k,j 5 Xk{Xn), (15) 
m=l 

where i and Ufa j circ the (k,i)— th and (k,j)-th entries of the orthogonal (or unitary) 
matrix U n in f TTH]) and A& is the fc-th largest eigenvalue of X n as in the first term in ([TBI . 

By Proposition 19.31 we have that 

( n ) &. S) ) Vx for i = j, ^ 
l ' 3 )5 otherwise. 

Thus we have 

M n (z) ^> M G ^ (z) := diag(l - 9 1 G tlx (z), . . . , 1 - 9 r G» x {z)), (17) 



and by Lemma [9.11 this convergence is uniform on {z G C ; d(z, [a, b]) > rj > 0} for each 
r] > 0. 



We now note that Hypotheses a), b) and c) of Lemma \Q. II are satisfied and follow from 
the definition of the Cauchy transform G flx . Hypothesis d) of Lemma 16.11 follows from 
the fact that X n is Hermitian while hypothesis e) has been established in (fT7j) . 
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Let us recall that the eigenvalues of X n which are out of [X n (X n ), Xi(X n )] are precisely 
those values z n where the matrix M n (z n ) is singular. As a consequence, we are now in a 
position where Theorem 12.11 follows by invoking Lemma [6. II Indeed, by Lemma l6.1[ if 

Z\ > ■ ■ ■ > z p 

denote the solutions of the equation 



J(l-9 l G, x (z)) = (zeR\[a,b}), 



i=i 



then their exists some sequences (z ni i),. . . , (z ntP ) converging respectively to z±, . . . , z p such 
that for any e > small enough, for n large enough, the eigenvalues of X n that are out of 
[a — £, b + e] are exactly z n> i, . . . , z n)P . Moreover, §5§ and Lemma IBTTT -fii) ensure that for 
n large enough, these eigenvalues have multiplicity one. 

6.2.3. Isolated eigenvalues in the setting where some 9i 's may coincide. We now treat the 
case where the 9^s are not supposed to be pairwise distinct. 

We denote, for 9 ^ 0, 

'G£(l/0) ifG Mx (a-)<l/e<G Mx (6+), 

<b if i/e>G m {b+), 

if 1/9 <G, lx (a-). 

a.s. 



We want to prove that for all 1 < i < s, Xi(X n ) —4 pe i and that for all < j < r — s, 
X n -j(X n ) —4- pe r _ y 

We shall treat only the case of largest eigenvalues (the case of smallest ones can be 
treated in the same way). So let us fix 1 < i < s and s > 0. 

There is 77 > such that \pg — pg. \ < e whenever \9 — 9i\ < 77. Consider pairwise distinct 
non zero real numbers 9[ > ■ ■ ■ > 9' r such that for all j = 1, . . . , r, 9j and 6j have the same 
sign and 

r 

]T(^-%) 2 <min(77V 2 ). 

It implies that \pg{ — pej ^ £■ With the notation in Section [6.2.21 for each n, we define 

Pt l = U P dmg(9[,...,9' r ,0,...,0)U P . 
Note that by [221 Cor. 6.3.8], we have, for all n, 

n r 

X>j(X„ + P' n ) - X,(X n + P n )f < 1r{P' n - P n f = - 9 3 f < e\ 

j=i j=i 

Theorem 12.11 can applied to X n + P' n (because the 9[, . . . , 9' r are pairwise distinct). It 
follows that almost surely, for n large enough, 

\Xi(X n + P' n ) - pe> \ < e. 



20 FLORENT BENAYCH-GEORGES AND RAJ RAO NADAKUDITI 

By the triangular inequality, almost surely, for n large enough, 

\Xi{X n + P n )- p et \ <3e, 
so that Xi(X n + P n ) p 8t . □ 

7. Proof of Theorem 12.21 

As in Section I6.2.2[ let 

X n = U x diag(A 1 , . . . , \ n )U* x , P n = U P diag(^, . . . , 9 r , 0, . . . , 0)U* P . 

The eigenvectors of X n + P n , are precisely Ux times the eigenvectors of 
diag(A 1; . . . , X n ) + U X U P diag(^, . . . , 9 r , 0, . . . , 0)U P U x . 

Consequently, we have proved Theorem 12.21 by proving the result in the setting where 
X n = diag(Ai, . . . , A n ) and P n = U n diag(#i, . . . , 9 r , 0, . . . , 0)U*, where U n is a Haar- 
distributed orthogonal (or unitary) matrix. 

As before, we denote the entries of U n by [u[j ]^=i- Let the columns of U n be denoted 
by ui, . . . ,u n and the n x r matrix whose columns are respectively u±, . . . , u r by U nr . 

Note that the itj's, as u, depend on n, hence should be denoted for example by uf^ and 
u( n >. The same, of course, is true for the Aj's and the Aj's. To simplify our notation, we 
shall suppress the n in the superscript. 

Let tq be the number of i's such that 9$ = 9i . Up to a reindexing of the 9iS (which 
are then no longer decreasing - this fact does not affect our proof), one can suppose that 
z'o = 1, 9\ = ■ • • = 9 ro . This choice implies that, for each n, ker(9iI n — P n ) is the linear span 
of the tq first columns ui, . . . , u ro of U n . By construction, these columns are orthonormal. 



Hence, we will have proved Theorem 12.21 if we can prove that as n — > oo, 

ro _-, 1 

Ell ■ — A I 2 a - s - 
9*G> (p) = ^T 

i=l »o fx^W C/ io J 



'it) J (p~t) 2 

and 

r 

^ |( Mj ,«)| 2 ^0. (19) 

i=ra+l 

As before, for every n and for all z outside the spectrum of X n , define the r x r random 
matrix: 



M n (z) :=I r -[9jG w {z)] r itj=1 , 

where, for all i, j — 1, . . . , r, /4j is the random complex measure defined by (1151) . 
In (fT7|) we established that: 

M n (-) ^ M Gmx (■) := diag(l - 0iG^(-), . . . , 1 - 9 r G /JiX (•)) (20) 
uniformly on G C ; d(z, [a, b]) > rj} for all r/ > 0. 
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We have established in Theorem 12.11 that because 9 io > l/G Atx (6 + ), \ io p = 
G~^(l/O io ) ^ [a, b] as n — > oo. It follows that: 

M n (z n ) ^> diag(0, . . . , 0, 1 - %±i, . . . , 1 - (21] 



ro zeros 



Proposition l5.1l a) states that for n large enough so that such that X io is not an eigenvalue 
of X n , the r x 1 vector 

~{u>i,u)~ 
U*, r u = : 

is in the kernel of the r x r matrix M n (z n ) with ||t/* r n||2 < 1. 

Thus by (1201) . any limit point of U* r u is in the kernel of the matrix on the right hand 
side of fl2"T|) . i.e. has its r — ro last coordinates equal to zero. 

Thus ffT9l) holds and we have proved Theorem l2.2l -b). We now establish (fLSl) . 

By ([HD, one has that for all n, the eigenvector u of X n associated with the eigenvalue 
Aj can be expressed as: 

u = (A io J n - A^) -1 ^ diag(6 l i, . . . , r )U*yu 



(\ io I n - X n ) 1 ^ 9 j M ) 



ro r 
= (A io J n - Xn)' 1 ^29 j (u j ,u)Uj + (Aj J n - Xn)' 1 ^ °J^'j- <')"j ■ 
j=l j=n+l 

" * " ' " " ' 

denoted by u' denoted by u" 

As Aj p £ [a, 6], the sequence (Aj J n — Xn)" 1 is bounded in operator norm so that by 
( TT9|) . 0. Since ||w|| = 1, this implies that \\u'\\ — ^ 1. 

Since we assumed that #j = ^ = ■ ■ • = 9 ro , we must have that: 

ri 

ll^'ll 2 = C V (M i5 u) ( Uj , u) u*(z n I n - X n )-\ . (22) 

* ' 

By Proposition I9.3j we have that for all i ^ j, fi\j 5 while for all /4™ fix- 
Thus, since we have that z n p ^ [a, 6], we have that for all i, j = 1, . . . , r , 

d /4?(*) a.s, / d/i(t) 



(^.-t) 2 J 7 (p-t) 2 ' 

Combining the relationship in ( l2"2"j) with the fact that \\u'\\ — 1, yields (fTSl) and we 
have proved Theorem 12.21 -a) . □ 
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8. Proof of Theorem 12.31 

Let us assume that 9 > 0. The proof supplied below can be easily ported to the setting 
where 9 < 0. 

We first note that the G'^ x {b + ) = -oo, implies that / = -G hlx {b + ) = +oo. We 

adopt the strategy employed in proving Theorem 12.2} and note that it suffices to prove 
the result in the setting where X n = diag(Ai, . . . , A n ) and P n = 9uu*, where u is a n x 1 
column vector uniformly distributed on the unit real (or complex) n-sphere. 

We denote the coordinates of u by ?4 , . . . and define, for each n, the random 
probability measure 



k=l 

The r = 1 setting of Proposition 15.11 b) states that the eigenvalues of X n + P n which 
are not eigenvalue of X n are the solutions of 

G ®W = \- 

Since G^ n )(z) decreases from +oo to for increasing values of z G (Ai, +oo), we have 

that Xi(X n + P n ) =: Ai > Ai. Reproducing the arguments leading to ([3]) in Section (14.21) . 
yields the relationship: 

\{uMr(0In-Pn))\ 2 = j W(t y (23) 

J (Xi-t) 2 



Thus, proving that: 



(Ai - ty 



+00 as n — > 00 



will yield the desired result. 
By hypothesis, we have that 

n ^ 



n 

a.s. 

, Ai > HX- 

n 

i=l 



By Theorem 12. 1[ we have that Ai —4- b so that 



1 n 



a.s. 



k=l 

Hence, by ([25]), 

n 

»|2 X _ a.s, 

k=l 

This implies that almost surely, 



~(n) I (™) 1 2 r a - s -> 



Uminf / & = liminf [f^l > / = +00 , 



n— >oo 



(Ai-t) 2 ™ 7 (6-t) 2 7 (b-t) 
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so that by (123|) . (u, ker(#I„ — P n ))\ 2 thereby proving Theorem 12.31 
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□ 



We omit the details of the proofs of Theorems I2.6H2.8[ since these are straightforward 
adaptations of the proofs of Theorems l2.11 Theorem 12.31 that can obtained by following 
the prescription in Proposition 15. li e). 

9. Appendix: convergence of weighted spectral measures 

9.1. A few facts about the weak convergence of complex measures. Recall that a 
sequence (jj, n ) of complex measures on R is said to converge weakly to a complex measure 
yU on K if, for any continuous bounded function / on R, 

/(t)<W*) — ► f f(t)dfi(t). (24) 

We now establish a lemma on the weak convergence of complex measures that will be 
useful in proving Proposition 19.31 We note that the counterpart of this lemma for proba- 
bility measures is well known. We did not find any reference in standard literature to the 
"complex measures version" stated next, so we provide a short proof. 

Recall that a sequence (/i n ) of complex measures on IR is said to be tight if 

lim sup |/i„|({t G E ; \t\ > R}) = 0. 

R— H-oo n 

Lemma 9.1. Let D be a dense subset of the set of continuous functions on K. tending to 
zero at infinity endowed with the topology of the uniform convergence. Consider a tight 
sequence (jj, n ) of complex measures on R such that (|// n |(R)) is bounded and (|24j) holds 
for any function f in D. Then (//„) converges weakly to \i. Moreover, the convergence of 
( j2l]) is uniform on any set of uniformly bounded and uniformly Lipschitz functions. 



Proof. Firstly, note that using the boundedness of (|/i n |(R)), one can easily extend 
to any continuous function tending to zero at infinity. It follows that for any continuous 
bounded function / and any continuous function g tending to zero at infinity, we have 



lim sup 



/d(/x - Hn) 



<sup / |/(l-0)|d(M + H), 



which can be made arbitrarily small by appropriately choosing g. The tightness hypothesis 
ensures that such a g can always be found. This proves that (/x n ) converges weakly to fi. 
The uniform convergence follows from a straightforward application of Ascoli's Theorem. 
□ 

9.2. Convergence of weighted spectral measures. 

Lemma 9.2. For each n, let = (u±, . . . , u n ), v^ n > = (v i, . . . , v n ) be the first two rows 
of a Haar distributed random orthogonal (or unitary) matrix. Let = (x±, . . . ,x n ) 
be a set of real-valued random number^ independent of {u^ n \v^). We suppose that 
sup n k < oo almost surely. 



1 Note that for each k, Uk, Vk and xu obviously depend on n. We could have represented this dependence 
in the subscript as u n ^ or u^L but we choose to suppress the n for notational brevity. 
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a) Then as n — > oo, 

UiViXt + u 2 v 2 x 2 H h u n v n x n 0. 



b) Suppose that -(x\ + x 2 + ■ ■ ■ + x n ) converges almost surely to a deterministic limit 
I. Then 

I 1 ,i 1 9 i I 1 2 a - s - j 
\U\\ Xl + \U 2 \ X 2 H h | tt n | ^ t- 

Proof. We prove a) by conditioning on the values of the x's. As a consequence, we can 
suppose the x^'s to be non-random. Let us define the random variable Z n = JiiViXi + 
• • • + u n v n x n . To establish a), it suffices to prove that E[Z^} = 0{n~ 2 ). We have 

n 

E[Z*] = ^2 XiXjXkXiEluiUjUkUiViVjVkVi]. 

i,j,k,l=l 

Since (u±, . . . ,u n ) and (vi, . . . ,v n ) are the first two rows of a Haar distributed random 
orthogonal (or unitary) matrix, we have that E[uiUjUkUiViVjVkv{\ = whenever one of the 
indices k, I, is different from the others [16]. It follows that 



VIZ*} < 3^^E[ U ^] 



h3 

< 3^x^ 2 (EK]EK 8 ]EK 8 ]E^])^. 
a y ' 

=E[u?l 



Since, by [TB], E[uf] = 0(n 4 ) and we have assumed that sup nfc \xf.\ < oo, u±ViXi + • ■ • + 



To prove b), we employ a different strategy. In the setting where (u\, . . . ,u n ) is the 
first row of a Haar distributed orthogonal matrix, the result follows from the application 
of a well-known concentration of measure result [251 Th. 2.3 and Prop. 1.8] which states 
that there are positive constants C, c such that for n large enough, for any 1-Lipschitz 
function f n on the unit sphere of MJ 1 , for all e > 0, 

P {!/>!, . . . ,u n ) - E[f n ( Ul , . . .,u n )}\ >e}< Ce- cn£ \ 

This implies that if E[f n (ui, . . . ,u n )] converges, as n — > oo, to a finite limit, then 
f n (ui, . . . , u n ) converges almost surely to the same limit. In the complex setting, we 
note that a uniformly distributed random vector on the unity sphere of C n is a uniformly 
distributed random vector on the unit sphere of 1R 2 ™ so that we have proved the results 
in the unitary setting as well. □ 

Proposition 9.3. Let, for each n, = (u\, . . . , u n ), i/ n ' = (vi, . . . ,v n ) be the two 
first columns of a uniform random orthogonal (resp. unitary) matrix. Let also = 
(Ai,...,A n ) be a random family of real number^ independent of (u^ n \v^) such that 
almost surely, sup nfc |Afc| < oo. We suppose that there exists a deterministic probability 
measure \i onM. such that almost surely, as n — > 0, ^ Y^k=i ^A fe converges weakly to \i. 



2 As in Lemma |9.2[ we have suppressed the index n for notational brevity. 
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Then as n — y oo, 

/%(») := Y^k=i \ u k\ 2 b\ k converges almost surely weakly to \i, (25) 
/%(») '■= Ylk=i^kVk^x k converges almost surely weakly to 0. (26) 

Proof. We use Lemma 19.11 Note first that almost surely, since sup nfc |Afc| < oo, both 
sequences are tight. Moreover, we have 

n 
k=l 

thus, by the Cauchy-Schwartz inequality, |/%(«) V (n)\{R) < 1. The set of continuous func- 
tions on the real line tending to zero at infinity admits a countable dense subset, so it 
suffices to prove that for any fixed such function /, the convergences of fl25|) and fl26l) hold 
almost surely when applied to /. This follows easily from Lemma [9.21 □ 
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